Acquiring Data for Textual Entailment Recognition
نویسنده
چکیده
Language resources are hardly ever large enough. Building language resources that can be used as a gold standard for semantic analysis requires effort and investment. We present a prototype for acquiring language resources by means of a language game which is a cheap but long-term method. Games employed to acquire language resources are not new. For example games with a purpose are used for collecting common sense knowledge. The game presented in this paper is a work in progress. It collects annotated pairs text–hypothesis suitable for recognizing textual entailment in Czech. The game narrative is based on Sherlock Holmes and dr. Watson dialogues. For generating the dialogue line we use rule-based approaches such as syntactic analysis, anaphora resolution, synonym and hypernym replacement, word order rearrangement and verb frame based inference. To generate natural sounding sentences we added a language model score (based on n-gram frequencies in a corpus).
منابع مشابه
Different Models and Approaches of Textual Entailment Recognition
Variability of semantic expression is a fundamental phenomenon of a natural language where same meaning can be expressed by different texts. The process of inferring a text from another is called textual entailment. Textual Entailment is useful in a wide range of applications, including question answering, summarization, text generation, and machine translation. The recognition of textual entai...
متن کاملUsing Ontology Alignment for the TAC RTE Challenge
In this paper we present the system that we used to participate to the TAC RTE challenge. The system relies on acquiring and aligning ontologies to recognize textual entailment. It automatically acquires an ontology representing the text fragment and another one representing the hypothesis, and then aligns the created ontologies. By learning from available textual entailment data, the system ca...
متن کاملConsidering Discourse References in Textual Entailment Annotation
In the 2009 Recognizing Textual Entailment challenge a Search Pilot task has been introduced, aimed at finding all the sentences in a corpus which entail a set of given hypotheses. The preparation of the data set for this task has provided an opportunity to better understand some phenomena concerning textual entailment recognition in a natural setting. This paper focuses on some problematic iss...
متن کاملChinese Textual Entailment Recognition Enhanced with Word Embedding
Textual entailment has been proposed as a unifying generic framework for modeling language variability and semantic inference in different Natural Language Processing (NLP) tasks. By evaluating on NTCIR-11 RITE3 Simplified Chinese subtask data set, this paper firstly demonstrates and compares the performance of Chinese textual entailment recognition models that combine different lexical, syntac...
متن کاملUsing Minimal Recursion Semantics for Entailment Recognition
This paper describes work on using Minimal Recursion Semantics (MRS) representations for the task of recognising textual entailment. I use entailment data from a SemEval-2010 shared task to develop and evaluate an entailment recognition heuristic. I compare my results to the shared task winner, and discuss differences in approaches. Finally, I run my system with multiple MRS representations per...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013